Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 8 de 8
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
PLoS One ; 4(3): e4688, 2009.
Artigo em Inglês | MEDLINE | ID: mdl-19259274

RESUMO

The identification of regulatory sequences in animal genomes remains a significant challenge. Comparative genomic methods that use patterns of evolutionary conservation to identify non-coding sequences with regulatory function have yielded many new vertebrate enhancers. However, these methods have not contributed significantly to the identification of regulatory sequences in sequenced invertebrate taxa. We demonstrate here that this differential success, which is often attributed to fundamental differences in the nature of vertebrate and invertebrate regulatory sequences, is instead primarily a product of the relatively small size of sequenced invertebrate genomes. We sequenced and compared loci involved in early embryonic patterning from four species of true fruit flies (family Tephritidae) that have genomes four to six times larger than those of Drosophila melanogaster. Unlike in Drosophila, where virtually all non-coding DNA is highly conserved, blocks of conserved non-coding sequence in tephritids are flanked by large stretches of poorly conserved sequence, similar to what is observed in vertebrate genomes. We tested the activities of nine conserved non-coding sequences flanking the even-skipped gene of the teprhitid Ceratis capitata in transgenic D. melanogaster embryos, six of which drove patterns that recapitulate those of known D. melanogaster enhancers. In contrast, none of the three non-conserved tephritid non-coding sequences that we tested drove expression in D. melanogaster embryos. Based on the landscape of non-coding conservation in tephritids, and our initial success in using conservation in tephritids to identify D. melanogaster regulatory sequences, we suggest that comparison of tephritid genomes may provide a systematic means to annotate the non-coding portion of the D. melanogaster genome. We also propose that large genomes be given more consideration in the selection of species for comparative genomics projects, to provide increased power to detect functional non-coding DNAs and to provide a less biased view of the evolution and function of animal genomes.


Assuntos
Genoma , Sequências Reguladoras de Ácido Nucleico , Animais , Bases de Dados Genéticas , Especificidade da Espécie
2.
PLoS Genet ; 4(6): e1000106, 2008 Jun 27.
Artigo em Inglês | MEDLINE | ID: mdl-18584029

RESUMO

The gene expression pattern specified by an animal regulatory sequence is generally viewed as arising from the particular arrangement of transcription factor binding sites it contains. However, we demonstrate here that regulatory sequences whose binding sites have been almost completely rearranged can still produce identical outputs. We sequenced the even-skipped locus from six species of scavenger flies (Sepsidae) that are highly diverged from the model species Drosophila melanogaster, but share its basic patterns of developmental gene expression. Although there is little sequence similarity between the sepsid eve enhancers and their well-characterized D. melanogaster counterparts, the sepsid and Drosophila enhancers drive nearly identical expression patterns in transgenic D. melanogaster embryos. We conclude that the molecular machinery that connects regulatory sequences to the transcription apparatus is more flexible than previously appreciated. In exploring this diverse collection of sequences to identify the shared features that account for their similar functions, we found a small number of short (20-30 bp) sequences nearly perfectly conserved among the species. These highly conserved sequences are strongly enriched for pairs of overlapping or adjacent binding sites. Together, these observations suggest that the local arrangement of binding sites relative to each other is more important than their overall arrangement into larger units of cis-regulatory function.


Assuntos
Dípteros/genética , Proteínas de Drosophila/genética , Drosophila/genética , Elementos Facilitadores Genéticos , Evolução Molecular , Proteínas de Homeodomínio/genética , Proteínas de Insetos/genética , Fatores de Transcrição/genética , Animais , Animais Geneticamente Modificados/classificação , Animais Geneticamente Modificados/embriologia , Animais Geneticamente Modificados/genética , Animais Geneticamente Modificados/metabolismo , Sequência de Bases , Sítios de Ligação , Sequência Conservada , Dípteros/classificação , Dípteros/metabolismo , Drosophila/classificação , Drosophila/embriologia , Drosophila/metabolismo , Proteínas de Drosophila/metabolismo , Embrião não Mamífero/metabolismo , Regulação da Expressão Gênica no Desenvolvimento , Genoma de Inseto , Proteínas de Homeodomínio/metabolismo , Proteínas de Insetos/metabolismo , Alinhamento de Sequência , Fatores de Transcrição/metabolismo
3.
PLoS Biol ; 6(2): e27, 2008 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-18271625

RESUMO

Identifying the genomic regions bound by sequence-specific regulatory factors is central both to deciphering the complex DNA cis-regulatory code that controls transcription in metazoans and to determining the range of genes that shape animal morphogenesis. We used whole-genome tiling arrays to map sequences bound in Drosophila melanogaster embryos by the six maternal and gap transcription factors that initiate anterior-posterior patterning. We find that these sequence-specific DNA binding proteins bind with quantitatively different specificities to highly overlapping sets of several thousand genomic regions in blastoderm embryos. Specific high- and moderate-affinity in vitro recognition sequences for each factor are enriched in bound regions. This enrichment, however, is not sufficient to explain the pattern of binding in vivo and varies in a context-dependent manner, demonstrating that higher-order rules must govern targeting of transcription factors. The more highly bound regions include all of the over 40 well-characterized enhancers known to respond to these factors as well as several hundred putative new cis-regulatory modules clustered near developmental regulators and other genes with patterned expression at this stage of embryogenesis. The new targets include most of the microRNAs (miRNAs) transcribed in the blastoderm, as well as all major zygotically transcribed dorsal-ventral patterning genes, whose expression we show to be quantitatively modulated by anterior-posterior factors. In addition to these highly bound regions, there are several thousand regions that are reproducibly bound at lower levels. However, these poorly bound regions are, collectively, far more distant from genes transcribed in the blastoderm than highly bound regions; are preferentially found in protein-coding sequences; and are less conserved than highly bound regions. Together these observations suggest that many of these poorly bound regions are not involved in early-embryonic transcriptional regulation, and a significant proportion may be nonfunctional. Surprisingly, for five of the six factors, their recognition sites are not unambiguously more constrained evolutionarily than the immediate flanking DNA, even in more highly bound and presumably functional regions, indicating that comparative DNA sequence analysis is limited in its ability to identify functional transcription factor targets.


Assuntos
Blastoderma/metabolismo , Drosophila melanogaster/embriologia , Fatores de Transcrição/metabolismo , Animais , Sítios de Ligação , DNA/metabolismo , Evolução Molecular , MicroRNAs/metabolismo
4.
Nature ; 450(7167): 203-18, 2007 Nov 08.
Artigo em Inglês | MEDLINE | ID: mdl-17994087

RESUMO

Comparative analysis of multiple genomes in a phylogenetic framework dramatically improves the precision and sensitivity of evolutionary inference, producing more robust results than single-genome analyses can provide. The genomes of 12 Drosophila species, ten of which are presented here for the first time (sechellia, simulans, yakuba, erecta, ananassae, persimilis, willistoni, mojavensis, virilis and grimshawi), illustrate how rates and patterns of sequence divergence across taxa can illuminate evolutionary processes on a genomic scale. These genome sequences augment the formidable genetic tools that have made Drosophila melanogaster a pre-eminent model for animal genetics, and will further catalyse fundamental research on mechanisms of development, cell biology, genetics, disease, neurobiology, behaviour, physiology and evolution. Despite remarkable similarities among these Drosophila species, we identified many putatively non-neutral changes in protein-coding genes, non-coding RNA genes, and cis-regulatory regions. These may prove to underlie differences in the ecology and behaviour of these diverse species.


Assuntos
Drosophila/classificação , Drosophila/genética , Evolução Molecular , Genes de Insetos/genética , Genoma de Inseto/genética , Genômica , Filogenia , Animais , Códon/genética , Elementos de DNA Transponíveis/genética , Drosophila/imunologia , Drosophila/metabolismo , Proteínas de Drosophila/genética , Ordem dos Genes/genética , Genoma Mitocondrial/genética , Imunidade/genética , Família Multigênica/genética , RNA não Traduzido/genética , Reprodução/genética , Alinhamento de Sequência , Análise de Sequência de DNA , Sintenia/genética
5.
PLoS Genet ; 2(10): e173, 2006 Oct 27.
Artigo em Inglês | MEDLINE | ID: mdl-17132051

RESUMO

The phylogenetic relationship of the now fully sequenced species Drosophila erecta and D. yakuba with respect to the D. melanogaster species complex has been a subject of controversy. All three possible groupings of the species have been reported in the past, though recent multi-gene studies suggest that D. erecta and D. yakuba are sister species. Using the whole genomes of each of these species as well as the four other fully sequenced species in the subgenus Sophophora, we set out to investigate the placement of D. erecta and D. yakuba in the D. melanogaster species group and to understand the cause of the past incongruence. Though we find that the phylogeny grouping D. erecta and D. yakuba together is the best supported, we also find widespread incongruence in nucleotide and amino acid substitutions, insertions and deletions, and gene trees. The time inferred to span the two key speciation events is short enough that under the coalescent model, the incongruence could be the result of incomplete lineage sorting. Consistent with the lineage-sorting hypothesis, substitutions supporting the same tree were spatially clustered. Support for the different trees was found to be linked to recombination such that adjacent genes support the same tree most often in regions of low recombination and substitutions supporting the same tree are most enriched roughly on the same scale as linkage disequilibrium, also consistent with lineage sorting. The incongruence was found to be statistically significant and robust to model and species choice. No systematic biases were found. We conclude that phylogenetic incongruence in the D. melanogaster species complex is the result, at least in part, of incomplete lineage sorting. Incomplete lineage sorting will likely cause phylogenetic incongruence in many comparative genomics datasets. Methods to infer the correct species tree, the history of every base in the genome, and comparative methods that control for and/or utilize this information will be valuable advancements for the field of comparative genomics.


Assuntos
Drosophila/classificação , Drosophila/genética , Genes de Insetos/genética , Especiação Genética , Filogenia , Animais , Sequência de Bases , Análise por Conglomerados , Drosophila melanogaster/genética , Evolução Molecular , Funções Verossimilhança , Modelos Genéticos
6.
PLoS Comput Biol ; 2(10): e130, 2006 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-17040121

RESUMO

The gain and loss of functional transcription factor binding sites has been proposed as a major source of evolutionary change in cis-regulatory DNA and gene expression. We have developed an evolutionary model to study binding-site turnover that uses multiple sequence alignments to assess the evolutionary constraint on individual binding sites, and to map gain and loss events along a phylogenetic tree. We apply this model to study the evolutionary dynamics of binding sites of the Drosophila melanogaster transcription factor Zeste, using genome-wide in vivo (ChIP-chip) binding data to identify functional Zeste binding sites, and the genome sequences of D. melanogaster, D. simulans, D. erecta, and D. yakuba to study their evolution. We estimate that more than 5% of functional Zeste binding sites in D. melanogaster were gained along the D. melanogaster lineage or lost along one of the other lineages. We find that Zeste-bound regions have a reduced rate of binding-site loss and an increased rate of binding-site gain relative to flanking sequences. Finally, we show that binding-site gains and losses are asymmetrically distributed with respect to D. melanogaster, consistent with lineage-specific acquisition and loss of Zeste-responsive regulatory elements.


Assuntos
Drosophila melanogaster/genética , Drosophila melanogaster/metabolismo , Elementos de Resposta/genética , Fatores de Transcrição/metabolismo , Animais , Sequência de Bases , Sítios de Ligação , Biologia Computacional , Sequência Conservada , DNA Intergênico/genética , Proteínas de Ligação a DNA/genética , Proteínas de Drosophila/genética , Evolução Molecular , Modelos Genéticos , Dados de Sequência Molecular , Regiões Promotoras Genéticas/genética , Seleção Genética , Alinhamento de Sequência
7.
BMC Bioinformatics ; 7: 376, 2006 Aug 14.
Artigo em Inglês | MEDLINE | ID: mdl-16904011

RESUMO

BACKGROUND: Molecular evolutionary studies of noncoding sequences rely on multiple alignments. Yet how multiple alignment accuracy varies across sequence types, tree topologies, divergences and tools, and further how this variation impacts specific inferences, remains unclear. RESULTS: Here we develop a molecular evolution simulation platform, CisEvolver, with models of background noncoding and transcription factor binding site evolution, and use simulated alignments to systematically examine multiple alignment accuracy and its impact on two key molecular evolutionary inferences: transcription factor binding site conservation and divergence estimation. We find that the accuracy of multiple alignments is determined almost exclusively by the pairwise divergence distance of the two most diverged species and that additional species have a negligible influence on alignment accuracy. Conserved transcription factor binding sites align better than surrounding noncoding DNA yet are often found to be misaligned at relatively short divergence distances, such that studies of binding site gain and loss could easily be confounded by alignment error. Divergence estimates from multiple alignments tend to be overestimated at short divergence distances but reach a tool specific divergence at which they cease to increase, leading to underestimation at long divergences. Our most striking finding was that overall alignment accuracy, binding site alignment accuracy and divergence estimation accuracy vary greatly across branches in a tree and are most accurate for terminal branches connecting sister taxa and least accurate for internal branches connecting sub-alignments. CONCLUSION: Our results suggest that variation in alignment accuracy can lead to errors in molecular evolutionary inferences that could be construed as biological variation. These findings have implications for which species to choose for analyses, what kind of errors would be expected for a given set of species and how multiple alignment tools and phylogenetic inference methods might be improved to minimize or control for alignment errors.


Assuntos
Evolução Molecular , Modelos Genéticos , Elementos Reguladores de Transcrição/genética , Alinhamento de Sequência/métodos , Sítios de Ligação/genética , Análise de Sequência de DNA/métodos , Especificidade da Espécie
8.
Genome Biol ; 5(12): R98, 2004.
Artigo em Inglês | MEDLINE | ID: mdl-15575972

RESUMO

We introduce a method (MONKEY) to identify conserved transcription-factor binding sites in multispecies alignments. MONKEY employs probabilistic models of factor specificity and binding-site evolution, on which basis we compute the likelihood that putative sites are conserved and assign statistical significance to each hit. Using genomes from the genus Saccharomyces, we illustrate how the significance of real sites increases with evolutionary distance and explore the relationship between conservation and function.


Assuntos
Evolução Biológica , Sequência Conservada , DNA/metabolismo , Alinhamento de Sequência , Software , Fatores de Transcrição/metabolismo , Sítios de Ligação , DNA Fúngico/metabolismo , Proteínas de Ligação a DNA/metabolismo , Regulação da Expressão Gênica , Modelos Genéticos , Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/metabolismo , Especificidade da Espécie
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...